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Abstract 

The memoryless noncoherent single-input single-output (SISO) Rayleigh-fading channel is 
considered. Closed-form expressions for the mutual information between the output and the 
input of this channel when the input magnitude distribution is discrete and restricted to having 
two mass points are derived, and it is subsequently shown how these expressions can be used to 
obtain closed-form expressions for the capacity of this channel for signal to noise ratio (SNR) 
values of up to approximately dB, and a tight capacity lower bound for SNR values between 
dB and 10 dB. The expressions for the channel capacity and its lower bound are given as 
functions of a parameter which can be obtained via numerical root-finding algorithms. 

Index Terms 
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1 Introduction 

Wireless communication channels in which neither the transmitter nor the receiver possess any 
knowledge of the channel propagation coefficients (also known as noncoherent channels) have re- 
cently been receiving a considerable amount of attention [1-6]. Such channels arise whenever the 
channel coherence time is too short to obtain a reliable estimate of the propagation coefficients via 
the standard pilot symbol technique (high mobility wireless systems are a typical example of such 
a scenario). They are currently less well understood than coherent channels, in which the channel 
state is assumed to be known to the receiver (and sometimes also the transmitter). 

In this correspondence, we consider the memoryless noncoherent single-input single-output 
(SISO) Rayleigh-fading channel, which was studied under the assumption of an average power 
constrained input in e.g. [4-6]. In [4], Abou-Faycal et al. rigorously proved (in the average power 
constrained input case) that the magnitude of the capacity-achieving distribution is discrete with a 
finite number of mass points, one of these mass points being necessarily located at the origin (zero 
magnitude). Using numerical optimisation algorithms, the authors also empirically found that a 
magnitude distribution with two mass points achieves capacity at low signal to noise ratio (SNR) 
values, and that the required number of mass points to achieve capacity increases monotonically 
with the SNR. Numerical optimisation algorithms remain however the only way to find the number 
of mass points of the capacity-achieving magnitude distribution for a given SNR. 
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Another important reference on the average power constrained memoryless noncoherent SISO 
Rayleigh fading channel is the work of Taricco and Elia [5], where lower and upper capacity bounds 
were established, and it was also proved that for high SNR values the capacity only grows double- 
logarithmically in the SNR. The upper bound from [5] was subsequently tightened by Lapidoth 
and Moser [6] in the framework of a more general study on capacity bounds and multiple-antenna 
systems on flat-fading channels. 

The problem of finding the capacity-achieving magnitude distribution of the memoryless nonco- 
herent SISO Rayleigh-fading channel in the average power constrained input case can be solved for 
low SNR values by using numerical optimisation algorithms as in [4] , but the optimisation problem 
becomes intractable for high SNR values due to its high dimensionality. An additional difficulty 
arises due to the fact that closed-form solutions for the integrals appearing in the expression for 
the channel mutual information when the input magnitude distribution is discrete are not available 
in the literature. Consequently, numerical integration algorithms must be repeatedly used when 
performing such an optimisation. 

In this letter, we derive closed-form expressions for the mutual information of the noncoherent 
Rayleigh-fading channel when the input magnitude distribution is discrete and has two mass points, 
thus completely eliminating the need for numerical integration in order to compute the mutual 
information in such a case. For low SNR values (of up to approximately dB) and an average 
power constrained input, in which case the capacity- achieving magnitude distribution is discrete 
with two mass points, these closed-form expressions additionally enable us to write the channel 
capacity as a function of a single parameter which can be obtained via numerical root-finding 
techniques. This capacity expression also becomes a tight capacity lower bound when the SNR 
takes values between dB and 10 dB. 

Part of the material in this correspondence can also be found in [7,8], where an expression for 
the mutual information of the noncoherent Rayleigh-fading channel, when the input magnitude dis- 
tribution is discrete and restricted to having only two mass points, was presented in the framework 
of a study regarding the capacity region of a two-user multiple-access channel in which the channel 
state is known neither to the transmitters nor to the receiver. The most important additional 
contributions of this letter lie firstly in a fully detailed and rigorous proof of the validity of this 
expression, secondly in the derivation of alternative analytical expressions for the same quantity 
(which naturally follow from the structure of our proof), thirdly in the discussion of a special case 
in which the expression provided in [7, 8] can be simplified, fourthly in noting that the hypergeo- 
metric functions [9] appearing in the expression from [7, 8] can also be expressed in terms of the 
incomplete beta function [9], and fifthly in showing how to obtain analytical expressions for the 
derivative of the mutual information of the noncoherent Rayleigh-fading channel (when the input 
magnitude distribution is discrete with two mass points) with respect to parameters of interest, 
with applications to capacity calculations and the derivation of a capacity lower bound. 

The remainder of this letter is organised as follows: the channel model is introduced in Sec. El 
closed-form expressions for the mutual information when the channel input magnitude distribution 
is discrete and has two mass points being subsequently derived in Sec. El Applications to capacity 
calculations (for SNR values of up to approximately dB) and the derivation of a capacity lower 
bound (for SNR values between dB and 10 dB) in the average power constrained input case are 
then discussed in Sec. |U and conclusions finally drawn in Sec. [51 
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2 Channel Model 



We consider discrete-time memoryless noncoherent SISO Rayleigh fading channels of the form [4] 

V k = A k U k + W k , (1) 

where for each time instant k G N, G C, Uk G C, G C, and Wk G C respectively represent 
the channel fading coefficient, the transmitted symbol, the received symbol, and the channel noise. 
The elements of the sequences {A^} and {Wk} are assumed to be zero mean i.i.d. circularly 
symmetric complex Gaussian random variables with variances respectively equal to 1 and a 2 > 0, 
i.e. Ah ~ JVc(0, 1) and Wk ~ A/c(0, a 2 ). It is also assumed that the elements of the sequences 
{Ak} and {Wk} are mutually independent. The channel Q being stationary and memoryless, we 
henceforth omit the notation of the time index k. 

A and W being circularly symmetric complex Gaussian distributed, it follows that conditioned 
on a value u of U, the channel output V also is circularly symmetric complex Gaussian distributed, 
with mean value zero and variance |ti| 2 + <7 2 . Consequently, conditioned on the input U, the channel 
output V is distributed according to the law 



fv\u(v\u)= eH^. (2) 

Note that since fv\u( v \ u ) 1S independent of the phase argu of the input signal, the latter quantity 
cannot carry any information. If we now make the variable transformations 

yZp\> X,Y>0, (3) 

we obtain an equivalent channel, the probability density of the output Y of which, conditioned on 
a value x of its input X, is given by 

fy\x(y\x) = -^-^ X, Y > 0. (4) 

Let I(X;Y) denote the mutual information [10] between the input and output of the channel 
with transition probability (jljl. and let fx(x) and fy(y) respectively denote the channel input and 
output probability density functions. Note that since fv\u( v \ u ) is independent of arg it and argv 
we always have I(X;Y) = I(U;V), where I(U;V) denotes the mutual information between the 
input and output of the channel with transition probability © • 

3 Closed-Form Expressions For I(X; Y) When The Input Distri- 
bution Has Two Mass Points 

For a given input probability density function fx(x), the mutual information /(X; Y) of the channel 
with transition probability @ is given by 1 



f°° f'°° fv\x(y\x) 
I(X;Y)= / f Yl x(y\x)fx(x)log JY [ X ) y \ U ydx, (5) 

Jo Jo jY{y) 



1 In this letter, logarithms will be taken in base e, and mutual information will be measured in nats. 
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with fy{y) = Jo°° fv\x{y\ x )fx{x) dx. In this section, we derive a closed-form expression for I(X; Y) 
when the input probability density function fx(x) has the form 

fx(x) = ai5(x - xt) + a 2 5(x - x 2 ), (6) 

where 5(-) denotes the Dirac distribution, a\ and a 2 are constants which, fx(x) being a probability 
density function, must be such that < a±, a 2 < 1 and a\ + a 2 = 1, xi,x 2 > by virtue of the fact 
that X = \U\ > 0, and we assume without loss of generality that x\ < x 2 . 

We will in addition assume that < a\,a 2 < 1, that x\ ^ x 2 , and that x\ = 0. The reason for 
the first two assumptions is that the case where either a% = 0, a\ = 1, or x\ = x 2 is of little interest 
since fx (x) then reduces to a probability density function with only one mass point and the mutual 
information vanishes. The reason for assuming that x\ = is that - when the input probability 
distribution must satisfy an average power constraint of the form E [X 2 ~^ < P for a given power 
budget P - the capacity-achieving input distribution of the channel with transition probability 
fY\xiv\x) given in (|1J) always has one mass point located at the origin [4], and consequently the 
case x\ = is one of great practical importance. We however would like to point out that the case 
x\ ^ can be treated in exactly the same manner as below and that we choose not do so in order 
to keep all equations as simple as possible. 

With this choice for fx(x), the mutual information Q becomes 

I(X;Y) = -a.-a.log^-a.-a.logixlW)-^ T^T^^log (E^H^) d ^ 

k=1 Jo x k +a \i =1 x l +a J 

(7) 

The difficulty to find a closed-form expression for I(X; Y) lies in finding an expression for integrals 
of the form (with x £ {0,X2}) 

o„, -,,2 /„. _.,2 „_ -y 2 \ 

(8) 

which appear in 0. Integrals resembling J{x) do not appear in tables such as [9], and to the 
best of the authors' knowledge no closed-form expression is currently available in the literature. 
How to derive such closed- form expressions for J(x), which can then be used to obtain closed- form 
expressions for the mutual information I(X;Y) when the input probability density function fx(x) 
is of the form ©, is shown in Sec. 13. II below. 

3.1 Closed-Form Expressions for J(x) 
Let us define 

x\ + a 1 cH a\ x% + a 1 

which remembering the above assumptions can be seen to be always strictly positive, and let us in 
addition introduce the strictly positive quantity 

yl * -2fM+£f2 w (io) 

whenever log/3 < (i.e. when < /3 < 1). Note that when this is the case, we have 

~ 2 e^ = -^-^ (ID 
a 2 x\ + a 2 

for the y 2 > defined in (|1U|) . We now provide expressions for J(x) in three different cases. 
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3.1.1 Case I: a € {1,±,±,...} 
Noting that 



** + =£? + **zh*M i+ ^) ■ (l2> 

it is easy to see that J(x) = Jn(x) + J\2(x) + Jis(x), with 

r -v ^ x 2 +^ 2 

Jii{x)= -r^- — 2V 2 : — ^e^+^dy = — — -, (13) 

Jo (x 2 + a 2 )[x{ + a 2 ) x^ + a 2 

/ — £ x^+cr^ (^y — loff — 

^2 + 0-2 ^0 x2 + 0-2 x 2 + °" 

and 

Jis(x) = log (l + r'e^) = (l - (-/?)-«) log (1 + /T 1 ) - j£ 



fc(-/3) fe -« 
(15) 

The last equality is a direct consequence of the fact that, for any a, b > we have, omitting the 
integration constant, 

1/a 

= ^ -(-b) k -^e- kay2 / a - (e~ y2 l a + (-ft) - - log ft) log (l + be- ay2 l a ^ , (16) 
k=l 

which can be verified by differentiating the expression on the right-hand side of the equality sign 
with respect to y and remembering that for any q ^ 1 and any positive integer n, we have Ylk=i 1 k = 
(q-q n+1 )/(l-q). 

3.1.2 Case II: < (3 < 1 and a {1, .} 

In this case, we have J(x) = J2i(x) + J22(x) + ^23(2;) + J2a(x) + ^25(2;) + J2e(x), with 

y* ,.2 



2^ 

CT 2 (x 2 + (7 2 

ai [ y * 2y -zjI 



J ^= I ^u2^„2, e ^ d y> ( 17 ) 



J22W = log^i jl _^ e ^ dy , (18) 
hz{x) = / y . e^ 7 log 1 + /3e^T^ dy, (19) 



X 2 + O" 2 



,3 _„2 



2//" 

„, (x 2 + a 2 %4 + ^)' 



J ^ x ) = / 7Z2- -Khr—l ^e^dy, (20) 



j25 ^- io TOr^^^ < 21 > 



and 



J ™( x )= I x 2 2 + a 2 e ^ ^ ( 1 + P^e^ ) dy. (22) 
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This can be seen by remembering ()12|) . observing that we furthermore also have 

log ( ^ + ^~ 2 e^\ = =4 + ~, + f 1 + , (23) 

and by writing the integration interval in the expression (JHJ) for J(x) in the form [0, y*) U [y*, oo). 

2 / / 2 2 \ 1 

Noting that e~ v *'^ x +a > = it is straightforward to compute J2i(x), J22OE), J24(a0, and J25(^c) 
to obtain 



j al ( a .) = ^l( x a + a a + p Sf)_l_4 > (24) 



./ (i-^)log^l ; (25) 



•/->iU-) = -( g2 + ^ + a ^ ^, (26) 



and 



J - w = ( log ^) 3i (2T> 

We now turn our attention to J23(x) and J2q{x), which are more delicate to compute. To find an 
expression for J23{x), note that for all y € [0, y*) we have < f3e ay /( z +cr ) < 1. Therefore, the 
identity [9] 

log(l + a) = Y J [ -J—a k , (28) 
fe=l 

which is valid for — 1 < a < 1, can be used to obtain 

J2 3 (x) = / g log 1 + /?e^^ dy (29) 

io x -\- (J \ J 

rv* 2v -v 2 .. ^ f-l) fc+1 u fesi 



fc=i 



oo 



{—j3) k i °° (— l) fc 

where the order of integration and summation can be inverted in (|31[) by virtue of Lebesgue's 
dominated convergence theorem [11]. Indeed, defining for n £ {1, 2, . . .} 

MV) = ~Tlh e ^ E ^^/3 fc e (33) 

k=l 

we see referring to Lemma Q] in Appendix A that \<p n (y)\ < 4>\{y) for all y E [0, y*), and since 
Jq* 4>i(y)dy < oo the assumptions of Lebesgue's dominated convergence theorem are verified. 
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Evaluating the sums appearing in (|32|) then yields, remembering that < j3 < 1 and that 



a i {l,i i ...}, 



fe=i fe=i fc=i v 7 

= io g (i +p)- 2 f 1 (\^; + 3^2 (V&Ia; -1) , (35) 



where 



p ± q 



k=0 



denotes the generalised hypergeometric series [9, 12, 13], with (a) k = 0(0 + 1) • • • (a + k - 1) the 
Pochhammer symbol [9] and z G C. In the special case where p = 2 and (/ = 1, the above 
series reduces to the extensively studied Gauss hypergeometric series, which is often also denoted 
£2; Vii z ) [9)12,13]. Although the 3F2 appearing in can be reduced to a 2-P1 by splitting 
the second sum appearing in (|32|) into partial fractions before summing, we do not do so because 
subsequent simplifications in the final expression for J{x) then become less apparent (this remark 
also applies to (|37|) below). 

Using a similar procedure to evaluate J 2 q(x) yields 



I3t, /1,1,^±I 



J 26 (x) = ^--3^2 ' L+ Q i ; -1 ■ (37) 

Putting the above results together, we obtain after some elementary manipulations 



J(x) = -1 - - 2 + log °\ + log(l + 0) - 2 F 1 ( ;-p) 1 38 ) 

a z a z a — 1 



a 

1 /ll 2=1 \ 1 /ll 2£±I 



a — 1 V 2, — — y a + 1 V 2 ' 
which can be further simplified to obtain 

</(*) = -1 - 4 + lo S °\ + lo g(! + /5) " 2i?1 f^S 1 ; + (39) 

o - cH a - 1 V ~~ — J sm S 

as shown in Appendix B. 

Note that the reason for splitting the integration interval [0, 00) into [0, y*) U [y*, 00) and evalu- 
ating the resulting integrals in different ways is the fact that the series expansion (|2"8)l only is valid 
for — 1 < a < 1. 

3.1.3 Case III: (3 > 1 

We now have 

2 

a 2 ^27^2 , 01 zV 

e 2+ CT > _ e T^ (40) 



a; 2 , + cr 2 cr 2 



for all y > 0, and consequently J(x) can be evaluated by using (|12[) together with (|28j) over the 
whole integration range [0, 00). This yields, using the same technique as in Sec. 13.1.21 J(x) = 
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J3i{x) + J^{x) + J33(x), with Jsi(x) = Jn(x) and J^{x) = J12 (») respectively given in lfT3|) 
and (UU), and 



Jaa(x) = y e^log l + f^ dy (41) 

Jo x + CT V / 

= log(l + (3- 1 ) - ^ 2 iq -/T 1 ) • (42) 

J(x) hence reads 

JW " + log ^ + ^ + ') " £l 2Fl C# " r ') ' <43> 

We now give some comments regarding the expressions for J(x) obtained in (|39|) when < [3 < 1 
and a ^ {1, 5, |, . . .}, and in (|4*3|) when (3 >1. 

First of all, we recall that in the case when p = q + 1, the series p F q defined in (|36|) converges 
for 1 2: 1 < 1, also when z = 1 provided that Re(^r/ — > Oi an d when 2 = — 1 provided that 
Re ?? — X] £) > — 1 [12]- (In our case, when p = q + 1 = 2 and ^ 77 — X] £ = the series converges 
for z = — 1 as well as for \z\ < 1 , whereas when p = q + 1 = 3 and ^2v~^2C = 1 the series converges 
for z = ±1 as well as for \z\ < 1.) However, p +iF p can be extended to a single-valued analytic 
function of z on the domain C\(l, 00) [13-15], and it is common practise to use the symbol p +iF p to 
denote both the resulting generalised hypergeometric function and the generalised hyper geometric 
series on the right hand side of ()36|) which represents the function inside the unit circle. 

In Appendix C, we show that the functions G(z) : C — > C and G-z^z) ■ C — > C, which are 
respectively defined in ((75|) and Q1UUJI . are analytic for all z G C\(— 00, —1]. Although the proof we 
have is somewhat technical, the consequences are far-reaching. Indeed, if we consider the expression 
on the right hand side of (|41|) as a function of z = /3 _1 , then we immediately see that this function 
is analytic for all z E C\(— 00, —1] by comparing it to the definition of G(z) in l)75|) . Additionally, 
the discussion in the previous paragraph shows that the right hand side of H42[). when considered 
as a function of z = f3~ l (with log(-) denoting the principal branch of the logarithm [16], which is 
analytic on C\(— 00, 0)) also is analytic for all z € C\(— 00, —1]. It then follows from the theory of 
complex analysis [16] that equality between the right hand sides of (|41jl and 1)42(1 not only holds for 
/3 > 1, but actually whenever (3~ l E C\(— 00, —1], and thus in particular for all (3 > 0. 

When a £ {1,±,±,...}, we can consider the right hand side of (|29[) as a function of z = (3, 
compare it to the definition of G2(z) in (|1UU|) to see that it is analytic for all z G C\(— 00, —1], and 
conclude as above that equality with the expression in (|35|) is assured not only for < (3 < 1 but 
for all (3 G C\(— 00, —1], and thus whenever (3 > 0. We can also show, using the same arguments 
and working with G{z) in (|75|) . that when a ^ {1, |, |, . . .}, equality between the right hand sides 
of (|22]> and (|37]> not only holds for < (3 < 1 but actually for all (3 G C\(— 00,— 1], and hence 
whenever f3 > 0. We have therefore proved that, when a ^ {1,^,^,...}, (|39|) in fact holds for all 
[3 G C\(— 00, —1], and as a consequence in particular when (3 > 0. 

We also would like to discuss in some more detail the restriction a ^ {1,^,^,...} which was 
made in Sec. 13.1.21 We see that 

^ 2 Fi(\^;-/?) (44) 

is defined neither when a = 1 because of the factor nor when a G {|, |, • • •} because in such 
a case 2a ~ 1 is a negative integer or zero, \— -\ > | 2o! |, and then the hypergeometric function 2F1 
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itself is not defined [9, 13]. Nonetheless, if we let d belong to the set {1, 5, |, • • •}, we still have for 
all /3 > that 



a/3 ^ A,S£l \ ph „ A,l,«- J 



lim iog(i + /?) ^- 2 iq 2ck~-%l ; -/? + ^— r 3^2 ' LA ; -i 

a^rf a — 1 V — — / a — 1 V - 



2y 



„2 



-j/ 



lim / — - e x 2 +<? 2 log 1 + /3e^+°^ dy (45) 

a-*dj x 2 + a 2 V / 

2y -y 2 ., ( „ N 

_^ -e^+^ log 1 + (3e* 2 +° 2 dy < 00, (46) 

or + cH \ J 

where the exchange of the order of the limit and integration operations again follows from Lebesgue's 
dominated convergence theorem (it is easy to verify that the assumptions are met). The last integral 
can be evaluated using the identity 

/ l/d-l , . k 
^ e^>log (l + &<-*"/■) dy = e~^ a £ S^L e ^/a (47) 
a k=o d~ k 



■((-6)4 - e"^) log (l + 6e^/°) - - AT* 8 /- 



which is valid for any a, b > and d G {1, |, |,...} (this can be verified by differentiating the 
expression on the right-hand side of the equality sign with respect to y), and where the integration 
constant has been omitted. For convenience we therefore will consider (|35[) and (|39[) to be valid 
also when a G {1, |, ,-, . . .}, it being understood that the value at such points is to be obtained 
by a limiting process, and that any necessary numerical evaluations can for example be performed 
using relations such as (|47jl. 

We now point the reader's attention to the fact that by setting x = X2 in (|3*9*|) and (|43|). and 
noting that then a = x 2 ,/ " 2 as can be seen from Q, we obtain the continuation formula 

a/3 ^ (I,— „\ a/3" 1 „ A,— „ i\ 

a - 1 V ST ) sin- a + 1 V ) 

Bearing in mind the remarks in the previous paragraph, this is valid in particular when a, (3 > 0, 
the case we are interested in. (Evaluating integrals in more than one way is a technique that has 
already been successfully used to derive identities involving the generalised hypergeometric function 
3^2 [14,15].) 

To conclude this section, we note that the hypergeometric functions appearing in (|39|) and 1)43(1 
can also be expressed in terms of the incomplete beta function B z (a, b) by virtue of the relation [9] 

Fi( ] a,b -,z)=bz- b B z (b,l-a), (49) 



2r \b + r , 

which is valid whenever the left hand side is defined. 



3.2 Closed-Form Expressions for I(X; Y) 

In Sec. 13.11 a closed-form expression for J{x) was provided in (|13|) -(|15 |) for the case where a G 
{1, 2) 3) ■ ■ •} an d (3 > 0, and two closed-form expressions for J(x) were provided in (|39|) and (|43|) 
for the case where a, (3 > 0. 
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In order to obtain an expression for I(X;Y) which is valid in the general case where a,/3 > 0, 
the two integrals appearing in (J7J) must be evaluated using or (|i3~|) . There are four different 
possibilities for how to do this, each with its own advantages and disadvantages. For example, if one 
is seeking an analytical expression which does not present indeterminations that must be lifted by a 
limiting process for values of a 6 {1, |, g, . . .}, one should use (J1H|) to evaluate both integrals in 0. 
One might on the other hand be interested in an expression where the series representation (|36j) 
of the hyper geometric function always converges. One then should use (|39[) if (3 < 1, and (|43[) if 
(3 > 1 to evaluate the integrals in ((7J). 

Another criterion to decide when to use (|39|) and when to use l)43JI to evaluate the integrals 
in is the simplicity of the resulting expression for I(X;Y): simple analytical expressions will 
generally be easier to analyse and provide more insight than more complex ones. For example, if 
we choose to evaluate the integrals in by using and (|4*3~)) respectively when k = 1 and k = 2, 
we obtain the expression 

2 / 1 — a 

I(X; Y) = -(l- a 2 ) log(l - a 2 ) - a 2 log a 2 - (1 - a 2 ) log(l + (3) - 2 F X ( ' 2 ^ ■ -/? 



CT 2 

+d - <*)^J - <* icd + r ') + 2Fl 5 + J ; _ /r A , (5 „) 

sin ^t- CT V -2- + 2 / 

where we have used the relation a\ = 1 — a 2 to eliminate ai, and one recognises the entropy H(X) 
of the channel input X, which is distributed according to the law fx(x) given in Q. We do not 
list here all the possible expressions for I(X; Y) which arise from the different ways of evaluating 
the integrals in Q, but instead conclude this section with a discussion of the properties of I(X; Y) 
as x 2 — > oo which can be deduced from (JSUJ). 

If we let x 2 — > oo in (|50|) . one can immediately see that 



2 

-(1 - a 2 ) log(l +0)- 2 F X ( ^7 & ; -/?) 



-02, 



by remembering @ and ()36j) . By use of 0, (JHEJ), ()4<Sj) . and L'Hospital's rule, it furthermore follows 
that 

i+4 

7T/3 ^2 
(! - Q 2) . —2 ► «2 



and 



-2 log(l + /T 1 ) + (i^M ^ f + X ; - r 1 

x 2 



0, 



although some effort is required here. As x 2 — > oo, we thus see that 

I(X; Y) — -(1 - a 2 ) log(l - a 2 ) - a 2 log a 2 = (51) 
This can be intuitively understood if one remembers that the mutual information can be written [10] 

I(X;Y) = H(X) - H(X\Y), (52) 

with H(X\Y) the conditional entropy of the channel input X given the channel output Y. As 
x 2 — > oo, the mass points of the channel input distribution fx(x) grow more and more separated, 
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and it is to be expected that it will be more and more difficult to make a wrong decision on the value 
of the channel input X if the channel output Y were to be known, and hence that H(X\Y) — > as 
we just proved. 

Note in passing that we also have found an exact expression for H{X\Y) as can be seen by 
comparing (|5U|) with (|51|) and (|52|) . 



4 Maximum Attainable Mutual Information and Application to 
Capacity Calculations 

We now consider the calculation of the capacity of the channel with transition probability fY\x(y\ x ) 
given in © when the input X must satisfy the average power constraint 7?LY 2 ] < P for a given 
power budget P. Let the signal to noise ratio (SNR) then be defined by 

SNR 4 p/ a 2 , (53) 

where a 2 denotes the noise power (introduced in Sec. [2}. In this case, it has been proved [4] that the 
capacity-achieving probability density function fx( x ) is discrete and has a finite number of mass 
points, one of these mass points being located at the origin {x = 0). It has also been empirically 
found (by means of numerical simulations) that a probability density function fx(%) with two mass 
points achieves capacity for low SNR values, and that the required number of mass points to achieve 
capacity increases monotonically with the SNR. 

Referring to the results from [4], we see that a distribution with two mass points achieves 
capacity for SNR values below approximately dB, and that the maximum mutual information 
that can be achieved by using such an input distribution is within 0.02 nats of the channel capacity 
whenever SNR < 10 dB. 

Following [4], let x\ = and x\ = P/d2 in fx(x) (see ©)• For any given power budget 
P, the input probability density function fx(x) is thus uniquely determined by the value of a<i- 
Furthermore, let 2 7 a2 (SNR) denote the value taken by the mutual information I(X;Y) for given 
values of 02, P, and a 2 (with x\ = P/a2 in ©), and let 

/*(SNR) = max 7 a2 (SNR) (54) 

0<(I2<1 

and 

a2(SNR) = arg max I a2 (SNR). (55) 

0<Q2<1 

7* (SNR) hence corresponds to the maximum attainable value of the mutual information I(X;Y) 
when the input probability density function fx(x) has two mass points, and a^SNR) corresponds 
to the probability of the nonzero mass point for which 7 a2 (SNR) = I* (SNR). The channel capacity 
C(SNR) is thus equal to 7* (SNR) for SNR values below approximately dB, and exceeds 7* (SNR) 
by less than 0.02 nats whenever SNR < 10 dB. 

Although a^SNR) can be obtained by solving a one-dimensional optimisation problem, it must 
also satisfy the equation 

37 a2 (SNR) 



dao 



0. (56) 



2 Note by referring to © and ll5Ull that I a2 (SNR) only depends on P and a 2 via the ratio SNR = P/a 2 
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An expression for dl a2 /da2 can be obtained by noting that 



da \l + hi(a) ) da l + hi(a) \ 2 + hi (a) ) 

dhxja) h 2 (a) (2, 1 + hi (a), 1 + hi (a) , , . 

+ da (l + h!(a)) 2 3 2 V 2 + h 1 (a),2 + h 1 (a) ,ft2W 

for any given differentiate functions hi(-) and h 2 (-) whenever the left hand-side is defined, and 
using this expression together with the expression for I(X; Y) that is obtained by evaluating both 
integrals in (JJJ) with the help of (|43|) . (Relation (|57|) can be established for values of h2(-) inside 
the unit circle by using the series representation (|36|) . and subsequently for all other values of 
fi2(-) G C\(l,oo) by using analytic continuation arguments.) 3 

It is clear that (|56j) must be satisfied by at least one value < 02 < 1: this can be seen by 
observing that I a2 (SNR) = 0, when a 2 = or a 2 = 1, and that I a2 (SNR) > when < a 2 < 1. 
Indeed, I a2 (SNR) being a continuous and differentiable function of a2 in this range, it follows that 
there is at least one value < a 2 < 1 which satisfies (|56|). Moreover, numerical experiments also 
seem to indicate that the solution of (|56|) always is unique although we presently do not have a 
proof of this property. 

For SNR values of up to approximately dB, we have thus found a closed- form expression 
for the capacity C(SNR) of the channel with transition probability (JIJ) as a function of a single 
parameter which can be obtained via numerical root-finding algorithms. 

Figs, n and El respectively show the maximum attainable mutual information /* 2 (SNR) when 
the input probability density function fx(x) is discrete and restricted to having two mass points, 
and the corresponding optimal probability a?! (SNR) of the nonzero mass point as a function of the 
SNR. These figures were obtained by numerically solving Q56JI for SNR values between — 10 dB and 
30 dB, which proved to be by far the simplest method. In Fig. ^ we have also plotted the constant 
log 2 = 0.693 ... for reference, and it can be seen as expected that limsNR^oo Ia 2 (SNR) = log 2, and 
that lim S NR->oo a^SNR) = \. 

Fig. Olshows the mutual information I a2 (SNR) as a function of the probability a 2 of the nonzero 
mass point when SNR € {— 5dB,5dB}. It is interesting to observe that I a2 (SNR) ceases to be 
concave in a 2 for sufficiently low SNR values, and that as SNR — > 00, the figure suggests that 
I a2 (SNR) indeed approaches the entropy H(X) of the channel input X as also mentioned earlier. 4 
Note additionally that in both cases shown in the figure the equation <9I a2 (SNR)/<9a2 = only has 
one solution a2 € (0, 1). 

5 Conclusions 

In this correspondence, we considered the memoryless noncoherent SISO Rayleigh-fading channel. 
We first derived closed-form expressions for the mutual information between the output and input 
signals of this channel when the input magnitude distribution is discrete and restricted to having 
two mass points, and subsequently showed how these expressions can be used to derive closed-form 
expressions for the capacity of this channel for SNR values below approximately dB, and a tight 
capacity lower bound for SNR values between dB and 10 dB. The expressions for the channel 



3 Note that l|57|l also could be used to obtain expressions for e.g. dl a2 (SNR) / d SNR, or for the derivative of I(X; Y) 
(when the distribution of X is discrete and has two mass points) with respect to any other parameter of interest. 

4 Note that since X is distributed according to the law fx(x) given in ©, H(X) is nothing more than the binary 
entropy function [nats], which attains its maximum value of log 2 = 0.693 . . . when a\ — a-z = |. 



12 



£ 0.7^ 




2 1 1 * 1 1 

-10 10 20 30 

SNR [dB] 



Figure 1: Maximum attainable mutual information I* 2 (SNR) [nats] as a function of the SNR [dB] 
when the input probability density function fx(x) is discrete and restricted to having two mass 
points. The constant log 2 = 0.693 . . . has also been plotted for reference (dashed line). 




-10 10 20 30 

SNR [dB] 



Figure 2: Optimal probability a^SNR) of the nonzero mass point as a function of the SNR [dB]. 



13 




Figure 3: Mutual Information I a2 (SNR) [nats] as a function of the probability 02 of the nonzero 
mass point for SNR € {—5 dB, 5 dB}. 



capacity and its lower bound are given as functions of a parameter which can be obtained via 
numerical root-finding algorithms. 



Appendix A 

In this appendix, we state and prove a lemma which will be used to show that it is legitimate to 
exchange the order of integration and summation in (|31|). 

Lemma 1. Let n be a positive integer, let < q < I, and let T n = Y^k=i ^~~~F — ^ ' ' Then 
<T n < q for all n. 

Proof. Let n and p be positive integers. The result is trivial if q = 0. We thus assume < q < 1 in 
the remainder of this proof. For all such q, we always have T\ = q, and T 2n < Tzn+i- The lemma 
will thus be proved if we show that < Tm and Ti n +\ < q for any positive integer n. Now, noting 
that 

f qP +1 _ q p qP 
p p+1 p p p + 1 

we see that T 2n = Y^k=o c 2fc+i > an d tnat T 2n +i = q - Yl=i c 2fc < Q- □ 



a Q <T T <t q y qp _ . , 

c p = — = — > 0, (58) 
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Appendix B 

In this appendix, we show how the expression for J(x) given in (|38jl can be reduced to the simpler 
equivalent expression given in (|39|) . Let 

1 /i i a ~ 1 \ 1 /i i "+ 1 

S = ° + ( 2, i# - 1 ) + ^TT3F 2 ( • ^ ;-l 



a 



^fc(fea-l) £-^k(ka + l) 
+ S (2fc + 1)((2A; + l)a - 1) ~ ^ 



fc= 
oo 



; _ Q (2fc + l)((2fc + l)a - 1) ^ 2fc(2fca - 1) 



A 



x i x i 

+ ^ (2fc + 1)((2A; + l)a + 1) ~ 2 2H2fca + 1) 

fc=0 k=l 

a + S!-S 2 + S 3 - Si. (59) 



with obvious definitions for Si, S2, S3, and S4, and the assumptions of Sec l3.1.2l still applying. These 
sums can be evaluated in terms of the psi function %[){■) [9], which admits the series representation 

00 

M = -, + E (t + 1)( « +1 + ^ (00) 

00 

with 7 = 0.577. . . denoting Euler's constant [9], as can easily be deduced from [9, 8.362 (1)]. This 
yields 

Sl = _2_ log2 _^(?__i), (62) 
S 3 = l+log2 + ^f^±i), (64) 

and 

Sa = , 

2 2 7 V2a< 

Here, (|6*2*|) can be verified by setting q = — ^- — | in (|6*U|) to evaluate Si + ^ + oVKlsr)' an d rioting 
that 

log2 = (l-i) + (i-i) + (i-i) + --- (66) 
00 j 

= S ( 2 fc + l)(2fc + 2)' (67) 

with (|66|) following from (|28|) at a = 1; (|63|) follows directly by setting g = — ^ in (|61j): (|64|) can 
be obtained by using (|EU|) with g = ^- — | to evaluate ^ + fV'C^ir) ~~ ^3 an d remembering (f&Tj) ; 
and (|63|) can be proved by setting q = 5- — 1 in the identity 

00 

^( g + l) = - 7 + ^- + ^ g (68) 



2 2 T V 2a 
5 4 = J+a + i^fi-). «>*) 
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which can easily be obtained starting from (|60|). to find an expression for ^ + ^VK^j) +a — S4 and 
observing that 

00 

1 = ( 1 -^) + a-3) + (|-i) + --- = E^T)- ( 69 ) 
Combining ((SHI) with $2$-$5§ yields 

^ = \ W -k)- + ^ + &) - *(h - M ( 7 °) 

= £[7rcot£+7rtan£] (71) 

(72) 



sm ■ 



where (j71j) follows from the relations [9] 

ifj(l — q) = ifj(q) + 7T cot irq (73) 
i>(l + = ^(3 - q) + n tan irq. (74) 

The simplification in (|38j) now follows immediately from (|59|) and (j72[) . 

Appendix C 

In this appendix, we show that the functions G(z) and (^(z), defined respectively in (|75|) and (jlUUj) . 
are analytic for all z G C\(— 00, —1]. This fact is used in Sec. 13. II to prove additional properties of 
the expressions for J(x) given in (f3U|) and (|4*3*|) . 

Let us consider the function G : C — > C defined by 

/oo , . 

2ay e^ 2 log (l + z e"^) dy, (75) 

with a, 6 positive real constants, c a nonnegative real constant, and where log(-) denotes the principal 
branch of the logarithm [16]. 

To prove that G{z) is analytic for all z € C\(— 00, —1], consider the function g : C x R + — > C 
defined by 

g(z, y) = lay a"** log ( 1 + z e~ b A . (76) 



For any fixed y it is clear that g(z,y), when considered as a function of z only, is analytic on the 
domain z G C\( — 00, —1] since the principal branch of the logarithm is analytic on C\(— 00, 0]. It 
hence satisfies the Cauchy-Riemann equations (which are a necessary and sufficient condition for a 
complex function to be analytic) [16] 

^l = ^!l and ^£ = / 77 ) 

dz r dzi dzi dz r 

for all z £ C\(— 00, —1], where u g (z r , Zi,y) = Reg(z r + izi, y) and v g (z r , Zi, y) = Img(z r + izi,y). 

In order to show that G(z) is analytic on C\(— 00, —1], we will show that the Cauchy-Riemann 
equations 

du G = dvc_ duc_ _ _^G_ ^ 

dz r dzi dzi dz r 
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are satisfied for all z E C\(— oo, —1], where uq(z t , z; L ) = Re G(z r + iz.j) and va(z r , %i) = Im G(z r + 
zzi). (We will sometimes write uq(z) instead of ua(z r , Zi), vg(z) instead of vc(z r , zi), u g (z,y) 
instead of u g (z r , Zj,y), and v g (z,y) instead of v g (z r , Zi,y) for convenience.) Since obviously 

poo poo 

u G (z) = u g (z,y)dy and v G (z) = / v g (z,y)dy, (79) 

J c J c 

it is sufficient to show that 

together with similar conditions for 

d f°° d f°° d f°° 

— / u g (z,y)dy, — / v g {z,y)dy, — / v g (z,y)dy (81) 
by virtue of (|77|l. In order to do so, we will use the following result [11]: 

Theorem 1. Let (y, A, fi) (with y a set, A a a -algebra on y, and fi a measure on A) be a measure 
space, let (bo,b\) be a (possibly infinite) interval of M., and let h : (6q,6i) x y — > R U oo. For all 
x € (bo, bi), let 

J(x)= [ h{x,y)dfi (82) 



jy 

which is assumed to be always finite. Let V be a neighbourhood of x* £ (&o>&i) such that: 
(i) For almost all y &y, h(x,y) is continuously differ entiable with respect to x on V. 
(ii) There exists an integrable function TL : y — > M U oo such that for all x € V 

dh 



dx^ v) 



Then JT(x) is differentiate at x* , and 

dj(x* 



< H(y) almost everywhere. (83) 



tf^-.y)^ (84) 



d.r 

Proof. See standard texts on Lebesgue integration such as [11]. □ 

In our case, y is the interval [c, oo), A is the Borel u-algebra on y, and \i is the standard 
Lebesgue measure on A. The function h(z r ,y) = u g (z r , Zi,y) with Zi fixed when proving (|8Uj). and 
similar definitions when proving the conditions in l|81j) . 

Now, for the g(z,y) defined in (|76|) . we have 

2 i 2 

%(z r , Zj, y) = 2aye~ ay log 1 + (z r + zzj)e " 

= aye^ 2 log ((l + z^f + (z^ 2 ) 2 ) , (85) 

and hence 

fin, 9.ai,p-( a + b )y 2 h 4- ^p" 6 ^ r)„ 

(86) 



du g _ 2m/e-( a +^ (1 + z r e~^ ) _ dvg_ 
9z r (l + Zj , e -by^ 2 + ( Zie -by 2 Y dzi 
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In the sequel, we will need to upper bound the absolute value of (|5H|) in different settings. This 
will always be accomplished by obtaining an upper bound M < oo for |l + z r e~ by | and a lower 

bound m > for | (l + z r eT h y 2 ) 2 + (^e"^ 2 ) 2 ). 

Let z* and z% be such that z* + izi G C\(— oo, —1], and let the neighbourhood V of z* be defined 

by 

{ {z*-e,z* r +e) if Zi = and z* G (-1, oo) 

\ 0*-M* + i) if^o, 1 j 

with e = > when z% = and z* G (— 1, oo). Then, 

\l + z r e- hy2 \ < l + \z r \ < 1 + \z*\ +max{l,e} = M, (88) 
for any z, E V, any given Zj, and any y G [c, oo); whereas if z% = 0, 
(1 + z r e"^ 2 ) 2 + {z^f = (1 + z r e"^ 2 ) 2 

> (1 + ^pV^ 2 ) 2 

>min|l,(l + ^) 2 } 4 mi (89) 
for any z r £V and any y G [c, oo), and if z% 7^ we have for any z r G V and any y G [c, 00) 

(l + z r e-^ 2 ) 2 + (^e-^ 2 ) 2 

> min { (1 + z r e- hc2 f + {z^f, 1, ^} 

> min { ( Zl e- bc2 ) 2 , 1, max{(z ,_ 1) 2^ +1)2}+2 J 

= ma, (90) 

as one can see by noting that 

inf (1 + Zr e~ b y 2 ) 2 + ( Zi e- b y 2 ) 2 (91) 

2/G[c,oo) 

must either be attained when y = c, when y — » 00, or when 

^{(l + z r e-^ 2 ) 2 + (^ 2 ) 2 } = 0. (92) 
Consequently, if we set m = min{mi, 1712} > 0, we see that for all z r G V and any y G [c, 00), 



< 2M °^-' J+% ', (93) 
m 



which is integrable on [c, 00) for any a,b,m, M > and any c > 0. Noting in addition that for any 
z r G V and any Zj, given in (|86|) is continuous in z r , we have proved that (|80|) holds for all 
z = z* + izi G C\(— 00, —1] by virtue of Theorem^ 

Similarly, let z r and 2* be such that z r + iz* G C\(— 00, —1], and let the neighbourhood V of z* 
be defined by 

f (Z* - 1, Z* + 1) if Z r G (-1, OO) 

1 (zf - e, z* + e) if z r G (-00, -1] and z* + 0, K } 
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with e = > when z* ^ 0. Then, 

|l + z r e~ by2 \ < 1 + \z r \ = M', (95) 
for any Z{ G V and any y G [c, oo); whereas if z r G ( — 1, oo), 

(1 + z r e-*>y + (z^ 2 ) 2 > (1 + Zre-^f 

> minjl, (1 + z r ) 2 | = mi, (96) 

for all Zj € V and any y G [c, oo), and if z* ^ 

(1 + Zr€ -»yy + ( Zie ~^ 2 ) 2 

> min { (1 + z r e- bc2 f + {z^f, 1, ^} 

> mini ((7* r)p~ bc2 ) 2 1 m M(<-g) 2 ,(<+g) 2 } 1 

A m' 2 , (97) 



for all 2j G V and any y G [c, oo). Consequently, if we set m' = minjmi, m^} > 0, we see that for 
all Zi G V and any y G [c, oo), 



(z r , Zj, y) 

OZi 



2M'aye-^ a+b ^ 2 , . 

< ^3 . (98) 



which is integrable on [c, oo) for any a, 6, m! , M' > and any c > 0. Noting in addition that 
for any z% G V and any z r , ^7 given in (J86|) is continuous in Zi, we have proved that the order 
of integration and derivation can be exchanged in the last expression appearing in (|81|) for all 
z = z r + iz* G C\(— 00, — 1]. 
Noting now that 



du g _ 2aye- { - a+ Vy {zjeT^ ) _dv R 
dz i (1 + z r e- b y 2 ) 2 + (zie-^ 2 ) 2 9z r ' 



(99) 



it is easy to transpose the above arguments to prove that exchanging the order of differentiation and 
integration is legitimate for all z G C\(— 00, —1] also in the case of the first and second expressions 
appearing in (|81|). and we have thus established the analytic character of the function G(z) defined 
in (f75|) on the domain C\(— 00, —1]. 

Let us now consider the function G2 ■ C — > C defined by 

G 2 (z) = J\aye- ay2 \og(l + ze- hy2 )dy, (100) 

with a a positive real constant, b G M., and c a nonnegative real constant. The arguments from 
this appendix can be used verbatim to prove that G%(z) also is analytic on C\(— 00, —1] - and the 
proof can even be simplified due to the fact that the integration interval [0, c) now being finite, it is 
sufficient that the corresponding functions u g2 (z r , Zi,y) and v g2 (z r , Zi,y) be differentiable functions 
of z r and z% on C\(— 00, —1] for all y G [0, c) to guarantee that the order in which integration and 
differentiation are performed in 1)80(1 and in the expressions given in (|81|) (with u g (z, y) and v g (z, y) 
respectively replaced by u g2 (z, y) and v g2 (z, y), and the integration interval [c, 00) replaced by [0, c)) 
can be exchanged [9, Sec. 12.211]. 
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